Metadata-augmented multilevel Bayesian network framework for image sensor fusion and main subject detection
نویسندگان
چکیده
Automatic main subject detection refers to the problem of determining salient or interesting objects in a photograph. We have used a multilevel Bayesian network-based approach for solving this problem in the unconstrained domain of consumer photographs. In our previous work, we described building an evidential reasoning and image sensor fusion framework that uses a number of low-level and high-level image sensors to determine the intended objects-of-interest in any target image. In this paper, we will describe recent work in adding metadata-augmented reasoning processes to this framework. Many image capture devices, e.g., digital cameras, record scene metadata along with the image. This metadata can contain useful information such as whether the flash was used, orientation of the image, focal range, etc. In addition, other metadata such as indoor-outdoor, orientation, and urban-rural classification can be generated using image understanding algorithms, or user annotation. We present a Bayesian network-based approach that accurately models the system and allows for metadataaugmented sensor integration in an evidential framework. The system seamlessly operates in the absence or presence of metadata with no user intervention required. We present subjective and analytical results that show the performance improvements achieved when scene metadata-augmented reasoning processes are used.
منابع مشابه
A New Fault Tolerant Nonlinear Model Predictive Controller Incorporating an UKF-Based Centralized Measurement Fusion Scheme
A new Fault Tolerant Controller (FTC) has been presented in this research by integrating a Fault Detection and Diagnosis (FDD) mechanism in a nonlinear model predictive controller framework. The proposed FDD utilizes a Multi-Sensor Data Fusion (MSDF) methodology to enhance its reliability and estimation accuracy. An augmented state-vector model is developed to incorporate the occurred senso...
متن کاملMultimodal Speaker Detection Using Input/Output Dynamic Bayesian Networks
Inferring users’ actions and intentions forms an integral part of design and development of any human-computer interface. The presence of noisy and at times ambiguous sensory data makes this problem challenging. We formulate a framework for temporal fusion of multiple sensors using input–output dynamic Bayesian networks (IODBNs). We find that contextual information about the state of the comput...
متن کاملMHIDCA: Multi Level Hybrid Intrusion Detection and Continuous Authentication for MANET Security
Mobile ad-hoc networks have attracted a great deal of attentions over the past few years. Considering their applications, the security issue has a great significance in them. Security scheme utilization that includes prevention and detection has the worth of consideration. In this paper, a method is presented that includes a multi-level security scheme to identify intrusion by sensors and authe...
متن کاملAn efficient method for cloud detection based on the feature-level fusion of Landsat-8 OLI spectral bands in deep convolutional neural network
Cloud segmentation is a critical pre-processing step for any multi-spectral satellite image application. In particular, disaster-related applications e.g., flood monitoring or rapid damage mapping, which are highly time and data-critical, require methods that produce accurate cloud masks in a short time while being able to adapt to large variations in the target domain (induced by atmospheric c...
متن کاملVision-Based Speaker Detection Using Bayesian Networks
The development of user interfaces based on vision and speech requires the solution of a challenging statistical inference problem: The intentions and actions of multiple individuals must be inferred from noisy and ambiguous data. We argue that Bayesian network models are an attractive statistical framework for cue fusion in these applications. Bayes nets combine a natural mechanism for express...
متن کامل